Predictive Timing Models

نویسندگان

  • Pierre-Luc Bacon
  • Borja Balle
  • Doina Precup
چکیده

We consider the problems of learning and planning in Markov decision processes with temporally extended actions represented in the options framework. We propose to use predictions about the duration of extended actions to represent the state and show how this leads to a compact predictive state representation model independent of the set of primitive actions. Then we develop a consistent and efficient spectral learning algorithm for such models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Evaluation of Dynamic Modulus Predictive Models for Asphalt Mixtures

Dynamic modulus characterizes the viscoelastic behavior of asphalt materials and is the most important input parameter for design and rehabilitation of flexible pavements using Mechanistic–Empirical Pavement Design Guide (MEPDG). Laboratory determination of dynamic modulus is very expensive and time consuming. To overcome this challenge, several predictive models were developed to determine dyn...

متن کامل

A Predictive Model for the Combustion Process in Dual Fuel Engines at Part Loads Using a Quasi Dimensional Multi Zone Model and Detailed Chemical Kinetics Mechanism

This work is carried out to investigate combustion characteristics of a dual fuel (diesel-gas) engine at part loads, using a quasi-dimensional multi zone combustion model (MZCM) for the combustion of diesel fuel and a single zone model with detailed chemical kinetics for the combustion of natural gas fuel. Chemical kinetic mechanisms consist of 184 reactions with 50 species. This combustion mod...

متن کامل

Prediction of emergence of Flixweed (Descurainia sophia) and Wild Oat (Avena fatua) using thermal time models in Winter Rapeseed (Brassica napus)

A thermal time (TT) model was developed to simulate field emergence of two weed species (flixweed and wild oat) in winter rapeseed. Practical predictive weed emergence models can provide information about timing of weed emergence. Non-linear regression models are usually able to accurately predict field emergence under specific environmental conditions. In the present study, cumulative seedling...

متن کامل

Spike timing dependent plasticity: mechanisms, significance, and controversies

Long-term modification of synaptic strength is one of the basic mechanisms of memory formation and activity-dependent refinement of neural circuits. This idea was purposed by Hebb to provide a basis for the formation of a cell assembly. Repetitive correlated activity of pre-synaptic and post-synaptic neurons can induce long-lasting synaptic strength modification, the direction and extent of whi...

متن کامل

A comparison of different network based modeling methods for prediction of the torque of a SI engine equipped with variable valve timing

Nowadays, due to increasing the complexity of IC engines, calibration task becomes more severe and the need to use surrogate models for investigating of the engine behavior arises. Accordingly, many black box modeling approaches have been used in this context among which network based models are of the most powerful approaches thanks to their flexible structures. In this paper four network base...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014